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1 Introduction 
1.1 

In the classical Black-Scholes model for financial markets, the stock price S(t) 
is modelled as a Geometric Brownian motion with the diffusion coefficient 
ll y/vS(t) v , where volatility v is assumed to be constant. This assumption is 
convenient for the option price "prediction" . Contrary to this assumption, the 
traders treat the volatility as a parameter that changes with time and whose 
future values have to be evaluated (predicted) for a given period of interest. In 
this connection, many researchers would rather interpret the volatility as a ran- 
dom process, v(t), and study, so-called, stochastic volatility models. It is natural 
to verify how the volatility v(t) changes in time for real stock prices and attempt 
to select a suitable stochastic volatility model. Traditionally it is proposed to 
apply Generalized Autoregressive Conditional Heteroscedasticity tracking algo- 
rithms (shortly GARCH, see, e.g. [T]-[5], HJ, 0, for tracking v{t) from 
the stock prices. It is known from J3| (p. 109) that GARCH algorithm operates 
satisfactory under relatively stable market conditions but fails when highly unan- 
ticipated events that lead to a significant structural change occur. Nevertheless, 
in many realistic settings, the simplest GARCH(p,q), p,q=l,2, algorithms are 
adequate for tracking volatilities even over long periods (see Bollerslev, Chou, 
and Kroner [Jj], pages 10 and 22). The main difficulty in implementation of 
GARCH comes from the multivariate minimization procedure of its parameters 
even for small values of p,q=l,2. 

In this paper, we propose a new approach for tuning the GARCH param- 
eters. Our approach uses ideas from Nonparametric Statistics combined with 
the Kalman-Bucy filter representation of a GARCH model. This representation 
enables us to select a GARCH model with only one parameter, that practically 
achieves tracking accuracy of GARCH(1,1). Moreover, the Kalman-Bucy ver- 
sion of GARCH(p,q) allows for a considerable simplification of the minimization 
procedure in the GARCH parameters. 



Let S(ti), i = 0, 1, . . . , n be the sample, with i,; — U-i =: A, of asset prices S(t), 
<t<T, from the Black-Scholes model (see 0, H) 



where B t is a Brownian motion, S(0) is the initial stock price, ji{t) and v(t) are 
strictly positive deterministic functions, respectively. Denote by 
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the observed heteroscedasticity. Since 
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By Ito's formula we find that 



J t%—\ J ti — i 

' yffiajdB.) 2 + ±( 0.5(2/*(«) - «( S ))ds) 2 (2) 

2 '** 



+ — y 0.5(2/i(a) - w(a))ds y ^Ms)dB s . 

The parameter A is usually small (for example, if the stock prices are measured 
once a day for three consecutive years, then A ~ 0.001). For sufficiently small 
A, the dominating term in Xi is ^(J^ 1 \J v(s)dB s ) 2 , with the mean 
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Vi-i = J v(s)ds 

and, under some smoothness assumptions, the error J* t ** i (u(s) — Vi_i)ds is 
sufficiently small and can be ignored. 

Following Bollerslev |3] and Engle ^U], the GARCH(p,q) provides estimates 
Vi of Vi by the recursion 

fij = + / , fl'jSi-j + X] a m X( i+1 )_ m , (3) 

j — 1 m— 1 

subject to some initial conditions, where parameters K, gi, . . . , g p , a±, . . . , a g , as 
well as p, g, have to be chosen with the help of minimizing the observed sum of 
squares (here n = -g-) 

1 n 

51, •••,S , p,<H, ...,Og) = ~y2(Xi - Vi-l) 2 . 

n i=l 

In contrast to ©, we propose an alternative tracking algorithm borrowed, 
from Khasminskii - Liptser, |16| . and Goldentayer - Liptser |12) . with the uni- 
variate minimizing parameter 7: 

W * = + -"i-l + ra 2(fc + l)/(2fc+3) ( X « - 

i>0') _ fiO") X ^(J+1) 1 fy ft \ 

"< - U i-l+ n W i-l + n (2(fe+l)-j)/(2fe+3) v*« "•-^ ( 4 ) 

j = l,...,fc-l 

ft(*0 _ftW I gfc(7) /y ft \ 

U i ' ^-1 + ?1 (fe+2)/(2fc+3) ~ 

subject to some initial conditions, where (ft (7), i = 0, 1, . . . , k are some prescribed 
functions and 7 is chosen to minimize 



n 

n ( 1 ) = ^(X i -v i ^f 



n 

i=i 
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A choice of the parameter k is imposed by the smoothness of v(t): k = 
when v(t) is Lipschitz continuous (with a global Lipschitz constant) while other 
positive values of k are used when v(t) has a bounded A:-th derivative. 

It should be noted that the estimator, given in Q), is also of GARCH type. 
In Sectional we give modification of (QJ which is compatible with GARCH(p,q). 

The proposed estimator admits a fast optimization procedure and enables to 
avoid local minima quite easily. Moreover, its GARCH(p,q) modifications are 
always stable and possess faster minimization than classical GARCH. 



2 Description of estimator. Quality of estima- 
tion 

2.1 Assumptions on preliminaries 

We assume that v(t) is a smooth function. If v(t) is a paths of random process we 
assume that this random process and the Brownian motion B(t) are independent. 

We use Nonparametric Statistics ideas for estimating a smooth function ob- 
served in the presence of white noise. Although we use some adaptive techniques, 
our method is different to that of Spokoiny and Mercurio ^7] , where the volatil- 
ity is approximated by a piecewise constant function. 

Assumption 2.1 The volatility v(t) is a strictly positive bounded function and 
belongs to the Ibragimov - Khasminskii - Stone subclass of functions (see, |14| . 
1 15) and |18j ) namely, k times differ entiable (k = included) with Lipschitz 
continuous k-th derivative. 

In the accordance with this assumption, there exists a positive number L such 
that for any i 

A i+fe 

\v(U)-v(U-i)\<Lj—-q V (5) 

Assumption 2.2 fi(t) is a positive and bounded function. 
Set 

1 [** 

Mi-i = ^ J ^{s)ds. (6) 

Introduce 

1 [ u 

ii = . = / \[vTs)dB s , i > 1 

and notice that then (£i)j>i forms an i.i.d. sequence of (0, 1)-Gaussian random 
variables. From J3J, it follows that 



Xi = 0.25(2/ii_i - A + 2 v /Au i _i(Ati-i - ^-O^ + v *-^?- 
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Denote 

rji = V /Av i _i(2^ i _i + + - 1), 

9i(A) — 0.25A(2^ij_i + ^-i) 2 . 

By Assumptions 12.11 and 12.21 #i(A) = 0(A) and (r?i)i>i forms a sequence of zero 
mean uncorrelated random variables with 

= A« i _ 1 (2/i i _i + f^i) 2 + 2v\_ x =: (7) 

with <t?'s the strictly positive and bounded numbers. Thus possesses the 
following structure: 

Xi = + ??i + fli(A). (8) 

2.2 A reductive model for Xi 

We replace by a simpler model: 

Xi = Vi-i + rn (9) 

For this model, it follows from Ibragimov, Khasminskii [TJ, J5| ( see also Stone 
|18|h that there exists a kernel type estimate Vi of Vi, generated by (Xj)i<i<„, 
such that for any i 

E{Vi - V t ) 2 < ( n -2(l+fc)/(2fc+3)) - ( 10) 

It is also known from Khasminskii and Liptser |16| that the rate in n, given in 
IjlUI) . remains valid for the on-line estimate obtained with the help of recurrent 
algorithm given below 



Vi 


= V t -1 


l~(i) , Qo /v - \ 

U 1 - 1 n 2(fe+l)/(2fc+3) ^ l Vl ~ 1 > 




— u i~i 


^O+i) | * / Y T 

+ n W i-l + n (2(*+l)-i)/(2*+3) \ A * ^ 


j 


= i,... 


1 




-w 
= «<-i 


+ n (fc+2)/(2fe+3) ^ l ^"iJ- 



(11) 



More exactly, the above-mentioned rate in n is preserved out of the boundary 
layer i > 0(n _1 /( 2fe+3 ) logn), resulting from uncertainty in the initial conditions 
for provided that (see |16j) 

Assumption 2.3 All roots of the characteristic polynomial 

p k (X, q) = A fe+1 + q X k + q 1 X k - 1 + ... + q k _ x X + q k (12) 
are different and have negative real parts. 
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2.3 Adaptive estimator design 

Out of the above-mentioned boundary layer i > 0(n _1 /( 2fc+3 Mogn) estimates 
(vi)i>x obey the following property (see, (frU|) '): 

hm" snpE( Vl - ^) 2 n 2(/c+i)/(2fc +3) < c(q)i (13) 

n — >oc v- 

where the supremum is taken over all Vi's satisfying Assumption 12. ll The pa- 
rameter C(q) depends on a filter gain q, the vector with entries qo,qi, ■ ■ ■ ,q k . 
So, preserving the rate in n, the asymptotic estimation accuracy depends on q 
chosen in the framework of Assumption 12. 3| It is clear that a direct minimiza- 
tion of C(q) in q may contradict Assumption ^. 31 Goldentayer and Liptser, [T2"] . 
proposed an approach, based on the Kalman-Bucy filtering theory, for minimiza- 
tion of C(q) while preserving Assumption 12.31 However, for this approach the 
assumption Etj 2 = a 2 was used. Although in the case considered here Erj 2 ^ a 2 , 
we shall still follow this methodology. For a known a 2 and a free parameter 7, 
set 

0=2. (14) 
a 

This parameter was introduced in 12. and plays a crucial role in creating the 
filtering gain q entries: 

9,(0) - Uoo$ 1/k+1 
9 i(0) = U m d 2 ' k+1 



q k (d) = U ok d k ' k+1 

where C/oj's are entries of the first column of U the positive definite matrix, which 
is the unique solution of the algebraic Riccati equation (* is the transposition 
symbol) aU + Ua* + B - U A* AU = with matrices 



A= (l 



of sizes 1 x (1 
H21 that 



0), = 

















,B = 






W 



k), (1 + k) X (1 + k, (1 + k) X 1 respectively. It is known from 



k 




t/01 


U02 


C/03 


C/04 





1 


NA 


NA 


NA 


NA 


1 


V2 


1 


NA 


NA 


NA 


2 


2 


2 


1 


NA 


NA 


3 


^4 + ^ 


2 + ^/2 


^4 + 78 


1 


NA 


4 


1 + 


3 + 


3 + V5 


l + v^ 


1 
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So in [T2]. we deal with the estimator 

1 tj ? ol//c+l 

vm = + -C\ w + n2(fc °: i)/(2fc+ 3) - suw) 

^ w = e w + ^ w + n SHl3) (*< - w) (15) 
i = i, . . . , fc - 1 

2.4 Global Adaptation 

We propose to use the estimator (|15H for tracking Uj's when the X^'s are defined 
in the accordance with ©, i.e. 

Xi = Vi-i+rn + OiiA). 

The univariate minimization with the help of $ guarantees Assumption 12.31 
However, since the variance of the noise is not constant and, moreover, un- 
known, an evaluation of C(q(i?)), as in would be difficult. Therefore, 
we follow GARCH-technique adaptive method (see, e.g., [2]) and evaluate 

Vn&) = |££=i(»i-i ~ 5i-iW) a via S„(0) - IELi (*< - ^-iW) 2 - We 
show that for A sufficiently small with probability close to one 

SnW) > S n (#") V n {d') > V n {d"). 

A crucial role in proving this implication plays the above-mentioned asymptotical 
estimate E(vi — Vi) 2 < 0(n~ 2 ( 1+ ' £ )/( 2fe+3 )) , n — > oo, which is valid not only for 
0i(A) = but also when 0j(A) = 0(A) (see Lemma lA~Tl in Appendix A). 

Theorem 2.1 For sufficiently large n and any d' ^ $" and any e > 

P(][SnO?') - S n [fi")\ - W) - V n (ti")}\ > e) 

<e- 2 0(n-( 4fe+5 )/( 2fc + 3 '). (16) 

Proof. Taking into account #j(A) = 0(A) = 0(n _1 ), we find that 

= V n (0) + - f> 2 + £ - (*)) 

n * — ' n * — ' 

i=l i=l 

2 n 

+ - £ K^-i - «<-i W) + Ofa -1 )]^ + 0(n- 2 ) 

7=1 
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and, therefore, 



[S„(0') - S n (ti")] - [V n (&) - F„(tf")] 



20(n" i ) 



1\ " 



E [(«s-riiW) - (<?"))] 

i=l 
„ n 

- E [($-!(*') - + 0(n -1 )]^ + 0(»~ 2 )- 



For notational convenience, set 



/ 2 

\ n 1=1 



+ ^E(^w- 5 -i( f ))+°^ 2 )) ■ 

i=l / 



The use of ( r i) — 3 r i provides 



r 2 



(a n 

< 3 + 0(0]%? 



n" 1 

v i=l 



r 2 



i=l / i=\ 

Applying obvious estimates 

fo(l), i^O^-Vw'^logn) 

~ |0( n -(2(fc+l)/(2fc+3)) ; j > ( n -l/« 2fc + 3 logn), 

we get the following upper bounds for rj, I = 1, 2, 3: 

r 2 < 0(0(0(0 + i £F=i E^,^) - ^-i(^")) 2 ), 
r| = 12^)^^^ (^_ lW _^_ l(r) )) 2 ) 
r 2 = 0(n- 2 ) < ( n -(4fc+5)/(2fc+3))_ 

Hence, with the help of Chebyshev's inequality we find that for sufficiently large 
n, any <&' ^ i?" and any e > the desired statement holds true. □ 

Remark 1 Theorem 12.11 enables a meaningful comparison between estimators 
corresponding to various values of For notational convenience, the filter for 
k = with the best parameter $ is called Filter 0. 
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3 Filters controlled by multiple parameters 



We restrict ourselves by consideration of GARCH(1,1) and GARCH(2,2) in the 
form of l|15(l . To distinguish these filters from classical GARCH's we denote 
them, by an analogy with Filter 0, by Filter 1 and Filter 2, respectively. The 
structure of these filters and the motivation for their applicability is given in 
Appendix IbI So, due to l|15H and l|29|l . we have 
Filter 1 

vi = vi-i (1-— + — + ( x i-Vi-i ; 17 
V n > n n z l i 



Filter 2 



v 



1 a(1) V2$, v 
M = f 1 - ^ - %U + (18) 



i — 1 1 I 

V n / n 

■IXi - Uj-ij. 



n 3/5 

It is assumed that < 01,02 < n and |i"T| <C n. The estimates generated by 
Filters 1 and 2 possess the optimal rate in n — > 00, while for fixed n the presence 
of additional parameters a%,K and ai,a2-ftT, respectively, enables slightly to 
improve (about 10%) the best value of 



S n (^,K,a 1 ,a 2 ) = — (X 4 - ^_i) 



»=i 

The main adaptive parameter remains 1?. A contribution of 01, K or ai, a2, if 
is not essential. This fact enables to simplify the tuning parameters procedure, 
partic- 
ularly to avoid local minima, in comparison with the standard tuning proce- 
dure for classical GARCH(1,1), GARCH(2,2) (see MATLAB GARCH Toolbox: 
http://www.mathworks.com/access/helpdesk/help/toolbox/garch/garch.shtml 



4 Computer implementation and simulations 

The volatility dynamics may differ widely between various types of assets. For 
example, the volatility changes for stocks and risky assets are too fast and the 
volatility values are relatively high. The composite indexes and exchange rates 
characterized by slow changes and smaller volatility values. This remark points 
out the difficultly of finding the best filter simultaneously for all assets. 

In simulations, we compare the results of Filter 0, Filter 1 and Filter 2, as well 
as the GARCH(1,1) and GARCH(2,2), provided by MATLAB. Though Filter 1 
and Filter 2 are equivalent to GARCH(1,1) and GARCH(2,2) respectively, the 
comparison of the numerical results show some advantage of Filters 1 and 2 due 
to different tuning procedures (see comment at the end of Section |2J • 
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4.1 Tuning procedure for Filters 1 and 2 



The univariate minimization process required for Filter is straightforward. For 
Filter 1 and Filter 2 we used unconstrained minimization, as given below. 

Filter 1 

1. Set ai = 0, if = and find ■&* = argmin^ S„(z9, 0, 0). 

2. FindA- = £E*=i*<. 

3. Find a\ — argmin ai S^i?*, if*, eta). 

4. Local minimization of S n (i}, if, a±) in vicinity of ($*, if*, a*). 
Filter 2 

1. Set ai = 0, a 2 = 0, if = and find tf* = argmin,, S n (ti, 0, 0, 0). 

2. Find if* = -Y\l ,1,. 

3. Find (a*, af) = argmin 0l 02 5 n (tf*, Jf*, ai, a 2 ). 

4. Local minimization of 5*„(t?, if, oi, a 2 ) in vicinity of ($*, if*, aj, a 2 ). 

The tuning procedures above consist in the univariate minimization over 
the computation of if and the minimization over ai,a 2 . These steps are 
supposed to provide some (i?*, if *, a*, a 2 ) in the vicinity on the minimum point, 
where the multidimensional minimization procedure is applied. 

The simulation results demonstrate that in the vicinity of ($*, if *, a*, a 2 ) 
the function 5 n ($, if, ai, a 2 ) behaves as a concave function and this property is 
preserved in a wide range around of (??*, if*, a*, af>)- Moreover, the minimum is 
not sharp, so that the minimization procedure does not require high 'resolution'. 
The corresponding marginal projections of 5 n (i?, if, oi, a 2 ) are given on Figures 
[U and 

4.2 Exchange rates 

The USD exchange rates we used for historical volatility estimation were taken 
for the period between Ol-Dec-01 and 18-Jan-04, e.g. n — 1466. Filter 1 and 
Filter 2 provide estimation error similar to GARCH(1,1) and GARCH(2,2). The 
tracking accuracy in terms of S n (i!}, if, oi, a 2 ) is given the Tabled 



Table 1: Average one-step prediction error for the exchange rates volatility 



Currency 


Filter type 


From 


To 


Garch(Ll) 


Garch(2,2) 


Filter 


Filter 1 


Filter 2 


AUD 


$ 


9.096e-006 


9.090e-006 


9.117e-006 


9.092e-006 


9.092e-006 


EUR 


$ 


5.857e-006 


5.848e-006 


5.869e-006 


5.856e-006 


5.856e-006 


NIS 


$ 


1.830e-006 


1.827e-006 


1.839e-006 


1.827e-006 


1.826e-006 


RUB 


$ 


5.026e-007 


4.915e-007 


5.026e-007 


4.935e-007 


4.880e-007 


YEN 


$ 


5.388e-006 


5.372e-006 


5.389e-006 


5.376e-006 


5.367e-006 
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Figure 1: Filter 1 coefficients behavior in the minima for IBM stock. 
4.3 Stocks 

We considered some stocks of large computer manufacturers and toy and en- 
tertainment companies. The information (adjusted close prices) for the period: 
24-Feb-99 to 28-Oct-03, n — 1176, was collected using Yahoo. The numerical 
results for S n ('d,K,ai,a2), corresponding the best tuning parameters, show se- 
rious differences in filters behavior (see Table [5J ■ A different quality of Filters 
1, 2 and GARCH(1,1), (2,2) is provided by different tuning procedures. Filter 1 
provides the best quality. 



Table 2: Average one-step prediction error for the filter 



Asset 


Filter type 


name 


Garch(l,l) 


Garch(2,2) 


Filter 


Filter 1 


Filter 2 


DIS 


4.309e-003 


4.314e-003 


4.329e-003 


4.284e-003 


4.285e-003 


HPQ 


3.741e-002 


3.771e-002 


3.615e-002 


3.608e-002 


3.608e-002 


IBM 


3.229e-003 


3.228e-003 


3.217e-003 


3.199e-003 


3.200e-003 


INTC 


1.232e-002 


1.230e-002 


1.235e-002 


1.223e-002 


1.232e-002 


MAT 


1.899e-002 


1.879e-002 


1.850e-002 


1.817e-002 


1.847e-002 


SUN 


5.144e-004 


5.131e-004 


5.143e-004 


5.135e-004 


5.138e-004 


TOY 


6.134e-003 


6.128e-003 


6.158e-003 


6.098e-003 


6.082e-003 
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Figure 2: Filter 2 coefficients behavior in the minima for IBM stock. 



4.4 Discussion of the numerical results 

The univariate minimization of Filter proves empirically to be very fast. The 
tuned parameter d* for Filter gives a hint for Filters 1 and 2 tuning procedures. 
The multivariate designs of Filters 1 and 2 provide slightly better tracking accu- 
racy than GARCH(1,1) and (2,2), respectively, especially for stock options. We 
attribute this effect to difficulties in tuning procedure of the filter parameters, 
especially for GARCH(2,2), which leads to local minima more often than tuning 
of for Filter 1 and 2. 



5 Conclusions 

The volatility estimation designs introduced in this paper are similar to the 
widely used GARCH algorithms. The presented designs allow simple adaptation 
and filter structure preserving estimation accuracy similar to GARCH. The filter 
structure of the presented design allows performance accuracy evaluation and 
enforces the stability of the estimator. 
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Figure 3: Historical volatility estimation for exchange rates of USD vs EUR. 



A Auxiliary Lemma 

Lemma A.l The rate in n given in l|10|) is preserved in the presence o/0,(A) ^ 
in (O. 

Proof. Let Vi and Ui be two estimates created by (|15f) with and without 
nuisance parameter respectively. We prove below that 

E(d t ~u t ) 2 < 0(n- 2 ). (19) 

Since 

n 2(fc+l)/(2fc+3) 

>0, n — ► oo, 

the nuisance parameter Oi(A) does not change the rate l|10fl in n — * oo. 
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Figure 4: Historical volatility estimation for IBM stock. 

For the notational convenience write 6{, instead of #j(A), and set = 
Vi — Ui, 5^ = — vfft , j = 1, . . . , k. Then, we have 



5 l 


= Si-i 


, 1,(1) Uoo$ 1/k+1 (a s , 
n <_1 n 2(fe+i)/(2fc+3) 0% - x ) 






- #>! 


n i_1 n (2(fc+l)-j)/(2fc+3) V W « 


(20) 


3 


= 1... 


1 




4 k) 


= e\ 


+ n (fe+2)/(2fc+2) r« 0j -U 





subject to the initial conditions 5(0) = 0, 5^(0) =0, j = 1, . . . , k. 
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Set 



q n = 



/ f7 00l 9 1 /(l+fc) n -2(l+fc)/(2fe+3) ^ 

C/oi^ 2/(1+fe) n _(2(1+fe)_1)/((2 ' c+3) 

V [/ 0fe ^n-( 2 ( 1 + fc )- fc )/( 2fc + 3 ) J 
( Si \ 



> q = 



£/ 01 tf2/(l+fe) 



V #0*0 / 



and Fi = 



and recall that matrices a and A are defined in (|2.3|) . We 



rewrite (|2(J[) to the vector-matrix form 

Fi = F 4 _i + -aFi-! + q n 6i - <\ n AFi_x 
n 

where Fq = 0. Set G; = C n Fi, where C n is the diagonal (1 + fc) x (l + fc)-matrix: 



/ n (l+fe)/(2fe+3) q 



Cn — 



V 



o 



fc/(2fc+3) 






2/(2fc+3) 





o \ 




A/(2k+3) 



Then, Go = and 

Gi — Gi-i H — C„aFi_i + G n q n #, — G n q„AFi_i. 

71 

By directly verifying identities G„a = n^^aC n , C n q n = n -( 1 + fe )/( 2fc + 3 ) 
have 

iCnaFi-! =n- 2 ( 1+fc )/( 2fc+3 )aG 4 _ 1 



(21) 
q, we 

(22) 



G„q„^=n-( 1+fc )/( 2fe+3 )q^. 

The structure of matrix A provides n,( 1+k ^ ( 2k+1 *> A = AC n . Hence and from 
C„q„ = n -(i+fc)/(2fe+3) qj it holds 

C„q n AF i _ 1 = n- 2 ( 1+fc )/( 2fc+3 )qAG J _ 1 . (23) 

Gathering now (JHJ, (|22H . (|23() . we find the recurrent equation for G^'s: 

G, = G,_! + n - 2 (i+^)/( 2 fe+ 3 ) ( a _ qA)Gi_! + n-( 1+fc >/( 2fc+3 V*- 

With the matrix D n = I + n - 2 ( 1 +' c )/( 2fe + 3 ) ( a _ this recurrent equation is 
transformed into d = D n Gi-\ + n~( 1+fc) /( 2fc+3 )q#j. Hence, due to G = 0, we 
have Gi = Y? P =i £ , l +1 " p n" (1+fe)/(2 ' !+3) q^. The latter and |^| = O^- 1 ) provide 

OO OO 

\\Gi\\ < 0(n- 1 )n- ( - 1+k '>/ ( - 2k+ ^ Ml = 0(n-( 3fe+4 )/( 2fc + 3 ) £ 



p=o 



p=0 
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On the other hand, by Statement 2 in |16| . for some positive constants c Q and 
C and any p the following estimate for \\D'P\\ is valid: 

Ml < ^exp ( - Con -2 P (i+fc)/(2*+3)). 

Consequently, 

||G l ||<0(n-( 3fc + 4 )/( 2fe + 3 ))(l- e --"- 2(1+fc)/(2 ' t+3) ) _1 
= 0( n ^ (fc+2)/(2fc+3) ). 

Finally, by Fi = C^ 1 Gi, we find that 

|«y«)| < n -(l+fc-j)/(2*+3)|| G .|| < ( n -l+j/(2fe+3) ; j = 0,l,...,fc. 

Hence, ^ (0) < 0(n~ 2 ) and OH) holds true. □ 

B GARCH in the form of flUD) 

The filter of Q type was proposed in |lf>| for tracking of functions from the 
Ibragimov - Khasminskii - Stone (IKS) class (see, ^3]> D3 and CHI)- A further 
implementation of this filter compatible with IKS(k)-subclasses, k — 0, 1, . . ., 
was developed in |12j (function f G (IKS)(k) if it is k-times differ entiable having 
Lipschitz continuous k-th derivative). 

An analysis of GARCH filter structure enables us to claim that GARCH fil- 
ter is compatible even with further subclass of (IKS)(k)'s. For fixed k, function 
/ from the subclass of (IKS)(k) satisfies a differential inequality (see, Goldensh- 
luger and Nemirovski, |llp: 

\f {k) (t) + ai/f*-^ (t) + a 2 f^ k - 2 \t) + ■■■ + a fc _!/ (1 ) + a k f(t) + a k \<L, 

where oi, a%, ■ ■ ■ , dk are such that the roots of polynomial 

P(x) = x k + aix^ 1 + a 2 x k ~ 2 H h a k 

have negative real parts. 

By an analogy to Chow, Khasminskii and Liptser, [7], we propose the follow- 
ing estimator for f(t) and its derivatives f^'(t), j = 1, . . . , k via the observations 
of the process X t with X = and dX t = f(t)dt + edW t : 

df(t) = /W (t)dt + -^§^ (dX t - f(t)dt) , 
dp\t) = p^\t)dt + g2j/ g fc+3) (dX t - f(t)dt), j = 1, k - 1, 
d/W (t) = - {aj^ (t) + a 2 / (fe - 2) (t) + • • • + au-^ (t) + a k f(t) 
+ a k K)dt+ g (dX t -f(t)dt). 



1G 



Now we show that 



E(f(t)-f(t)) ^ xe 4fe/(2fe+3) 

E(f {j \t) - f u Ht)) 2 x ^(k-iVPfc+a), j = 1, ...,k. 



(25) 



Set A(t) =/(*)- /(t) and A^(t) = - p\t). From J21 and 

=/«+D ( t ), j = l,...,fc-l, 

/<*> (t) = - (d/'*" 1 ) (t) + a 2 /( fc - 2 ) (<)+■•■ + a fc _ x /« (t) + a fc /(t) + a fc ^) , 
we derive 

dA(t)=A^{t)dt- 

fe-i 



£ 2/(2 fe+ 3) (eiWi + A(t)dt), 

(edWt + A(t)di), j = 1, fc - 1, 



e 2j/(2fc+3) 

dA« (*)=-( X] «^ A(fc " £) (*) + «fe A (*) + «(*)) 



-^7&T5)(^ + A W d< )- 

Following [7], we introduce 

£ 2(fc+l)/(2fc+3) ' " ^ £ 2(fc+l-j)/(2fc+3) ' 7 

and notice that 

dtf(i) = - go*(t)]«ft - g dW t £ , 

d$W(t) - [5^(t) - q 3 6(t)]dt - q dW t e , j = 1, fc - 1, 

d<5«(t) - -(^ e M /( 2fc + 3 ) ,(j( fc -')(i) + £ 2fc /( 2fc+3 ) a ^(t)) 



(26) 



(27) 



M ( fe 2/(2fc+3)^ t _ qk5 ^ dt _ qkdW e 



where W£ 
Set 



l/(2fc + 3) W tE 2/(2k + 3) 



is the standard Wiener process. 



D(t) = 



V <5« J 



U(t) = 



^ u(te 2/(2fc+3) y 



/«o\ 
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and introduce matrices a £ and A of sizes (fc+l)x(fc+l) and lxfc + 1, respectively, 







1 
1 







y_ £ 2k/(2k+3) ak 



1 

. -£ 2 /(2fc+3) ai J 



and A = (l ... 0.) . With the help of the introduced matrices, we rewrite 
<|27l) into the vector-matrix form: 

dD(t) = (o e - QA)D(t)dt + U(t)dt - QdWf. 

It is known from [7] that the vector Q may be chosen such that the eigenvalues 
of the matrix clq — QA have negative real parts. This property is preserved for 
a £ — QA, at least for sufficiently small e. Henceforth, we assume this property 
for a e — QA too. Consider now the Lyapunov equation (here I is a unit matrix) 



(a E - QA)P £ + P e {a e - QA)* +1 = 0. 

It is clear that P e is the unique positive definite matrix and lim e ^o Pe 
where Pq is the unique matrix solving the Lyapunov equation 

(a - QA)P + P {a - QA)* +1 = 0. 



(28) 
= Po, 



Denote \\D(t)\\ 2 Pe (:= (D*(t)P e D(t))). With the help of the ltd formula and QgJ 
we find that 

d\\D(t)\\'k = (-WW + 2(D{t),P s U(t)) + (Q,P £ Q))dt 
+ 2(D{t),PeQdW t ). 

Therefore the function V(t) — E\\D(i)Wp is differentiable and 

V(t) = E(- \\D{t)\\ 2 + 2(D(t), P £ U(t)) + (Q, P e Q>) . 

It is obvious that for sufficiently small e positive constant ci, C2, C3, can be found 
such that E\\D(t)\\ 2 > c x V{t), 2E(D(t),P e U(t)) < c 2v /^), (Q,P e Q) < c 3 . 
Hence, V{t) < — aV(t)+C2y/V(t)+C3. The use of the inequality y/x < a~ 1 +ax, 
a > 0, x > 0, with a = ^— , provides 



2c 2 

V(t) < -0.5dV(t) + — 

Cl 



c 3 . 



Hence, for any t > we have V(t) < V(0) + 2c g+g ca . 
Obviously, this property and (|26|l provide (|25l) . 
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For the discrete time setting with yjt\ — ti-\ = e, adapted to the framework 
of ^B], we find that 

T _ f i j gQ (y _ ? \ 

Ji - Ti-l + ~ V i-l + n 2(fe+l)/(2fe+3) \ Ai h- 1 ) 

f(j) _ f{j) , j 91 / y- _ f \ 

J i ~ Jl - 1 ^ n Ji - 1 + n (2(fe+l)-j)/(2fc+3) V 2 

j = l,...,fc-l 

fc-1 



F = M (i--)--(e «#r° + + ^ 

V n J n \ *■ — ' 



n 
Qk 



n (fc+2)/(2fc+3) 

Finally, in the framework of this paper, we have 

Vi - Vi-x -f + o(k+l)/(2k+3) \ 



n - - n ,(fc+l)/(2fc+3) "i- 1 1 

j=l,...,fc-l 

fc-i 



W i - V i-1 + + (2(fc+l)-j)/(2fc+3) V^* 



(29) 



.(fc) Ak) fi °l\ 1 fV^ -(fc-f) , - . > - 

w i = v l-i y 1 - —) - - y aeV i-i + a kVi-i + a k K 



n / n 
C/ofc^ 



=2 

Xj - Vi-i) 



1 n (fc+2)/(2fc+3) 

References 

[1] Andersen, T., Bollcrslev, T., Diebold, F.X. and Labys, P. Exchange Rate 
Returns Standardized by Realized Volatility are (Nearly) Gaussian. , Multi- 
national Finance Journal , 4, 2000, pp. 159-179. 

[2] Baillie, R.T., T, Bollerslev, Prediction in Dynamic Models with Time- 
Dependent Conditional Variances, Journal of Econometrics, 52, 1992, pp. 
91-113. 

[3] Black, F. The Pricing of Commodity Contracts. Journal of Financial Eco- 
nomics, 9, 1976, pp. 167-179. 

[4] Black, F. and M. Scholes, The Pricing of Options and Coporate Liabilities, 
Journal of Political Economics, 81, 1973, pp. 637-659. 

[5] Bollerslev, T., Generalized Autoregressive Conditional Heteroskedasticity, 
Journal of Econometrics, 31, pp. 307-327, 1986. 

[6] Bollerslev, T., R.Y. Chou, K.F. Kroner, ARCH Modeling in Finance: A 
Review of the Theory and Empirical Evidence, Journal of Econometrics, 
52, pp. 5-59, 1992. 



19 



[7] Chow, P.L., Khasminskii, R. and Liptscr, R. (1997) Tracking of signal and 
its derivatives in the Gaussian White noise, Stoc. Proc. Appl. 69, 2, 259-273. 

[8] Day, T.E. and CM. Lewis . Forecasting Futures Market Volatility, Journal 
of Derivatives, Winter 1993. 

[9] Duan, J. C. The GARCH Option Pricing Model. Mathematical Finance. 5 
(1), 1995, pp. 13-32. 

[10] Engle, Robert, Autoregressive Conditional Heteroskedasticity with Esti- 
mates of the Variance of United Kingdom Inflation, Econometrica, 50, pp. 
987-1007, 1982. 

[11] Goldenshlugcr, A. and Nemirovski, A. Adaptive de-noising of signals sat- 
isfying differential inequalities. IEEE Transactions on Information Theory, 
43, 3, pp. 873-889, 1997 

[12] Goldentayer, L. and Liptscr, R. On-line tracking of a smooth regression 
function, Statistic Inference for Stochastic Processes, to apear in 2003. 

[13] Hamilton, J.D., Time Series Analysis, Princeton University Press, 1994. 

[14] Ibragimov, I. and Khasminskii, R., On nonparametric estimation of regres- 
sion, Soviet Math.Dokl.,21, 1980, pp. 810-814. 

[15] Ibragimov, I. and Khasminskii, R. Statistical estimation: Asymptotic the- 
ory. Springer Verlag, 1981 (Russian ed.1979). 

[16] Khasminskii, R. and Liptser, R. On-line estimation of a smooth regression 
function. Theory of Probability and its Applications. 3 (2002). 

[17] Mercurio, Danilo and Spokoiny, Vladimir, em Statistical inference 
for timc-inhomogeneous volatility models., 583, 2000, www.wias- 
berlin.de/publications/preprints/indcx-2000.html 

[18] Stone, C. Optimal global rates of convergence for nonparametric regression, 
Ann. Statist., 10, 1982, pp.1040-1053. 



20 



